Retrieving BioMedical Information: Challenges and Possibilities
نویسنده
چکیده
A large amount of biomedical information is available to researchers today, and it is continuously increasing. As a result, researchers widely agree that the ability to precisely retrieve desired information is vital to use the available knowledge. A way to achieve this is providing a retrieval system that is not only able to retrieve the available and sought information, but also to filter out irrelevant documents, while giving the relevant ones the highest ranking. The main goal of this work has been to investigate how to improve the ability for a system to find and rank relevant documents. Our method is based on applying series of information retrieval techniques to search in biomedical information and combine them in an optimal manner. These techniques include extending and using well-established information retrieval (IR) similarity models like the TF-IDF and BM25 as the scoring schemes, and applying personalisation so that researchers may affect the ranking based on their view of relevance. The techniques have been implemented and tested in a proof-of-concept prototype called BioTracer, extending a Java-based open source search engine library. The preliminary results from our experiments using the TREC 2004 Genomic Track collection seem satisfactory, with the best mean average precision (MAP) of 0.5129 and the best precision at 100 retrieved documents (P@100) of 0.473. What can be concluded from these results is that involving the users in the search will often have positive effects on the ranking of search results, and that our BioTracer system represents a tool that may be able to meet the user’s information needs.
منابع مشابه
Web Crawling Agents for Retrieving Biomedical Information
Autonomous agents for topic driven retrieval of information from the Web are currently a very active area of research. The ability to conduct real time searches for information is important for many users including biomedical scientists, health care professionals and the general public. We present preliminary research on different retrieval agents tested on their ability to retrieve biomedical ...
متن کاملComparison of Bibliographic Databases in Retrieving Information on Telemedicine
Background & Aims: Some of the main questions which can be of importance for those researchers who intend to perform a systematic review in a field of science are: ‘What databases should I use for my review?’; ‘Do all these databases have the same value?’; and ‘Which sourcesretrieved the highest of relevant references?’. The main aim of this work was the identification of the best database for ...
متن کاملAn automatic method for retrieving and indexing catalogues of biomedical courses.
Although there is wide information about Biomedical Informatics education and courses in different Websites, information is usually not exhaustive and difficult to update. We propose a new methodology based on information retrieval techniques for extracting, indexing and retrieving automatically information about educational offers. A web application has been developed to make available such in...
متن کاملA Short Survey of Biomedical Relation Extraction Techniques
Biomedical information is growing rapidly in the recent years and retrieving useful data through information extraction system is getting more attention. In the current research, we focus on different aspects of relation extraction techniques in biomedical domain and briefly describe the state-of-the-art for relation extraction between a variety of biological elements.
متن کاملAcquiring, Storing and Retrieving Diverse Biomedical Data Using the World-Wide-Web: The SenseLab Paradigm
The complexity of biomedical data creates unique challenges in their acquisition, storage and retrieval. Recent advances in the world-wide-web, database software and database-to-web middleware combined with the acceptance and use of the Internet in the scientific community create a strong framework to face these challenges. We describe SenseLab as a paradigm of a project that integrates the fac...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010